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ABSTRACT 

Criterion-referenced evaluation (ORE) describes 
achievement in performance terms, whereas norm-referenced evaluation 
(NRE) compares the performance of one individual to that of others 
with respect to a given evaluation instrument* Vocational educators 
who base their programs on behaviorism commonly evaluate student 
performance from a CRE perspective. When deciding whether to use CRL 
or NRE to evaluate a given instance of learning, vocational 
instructors must examine their purposes in undertaking the given 
evaluation. In situations when an individual's performance must be 
measured in terms of comparison with a standardized sample, NRE is 
the obvious choice. However, vocational educators must also realize 
the value of CRE in assessing the effectiveness of given 
instructional treatments in achieving specific instructional 
objectives. (MN) 
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INTRODUCTION 

The term criterion-referenced evaluation (CRE) was 

first proposed by Glaser in 1963 (Anastasi, 1982:9A) and is 

used, according to Mehrens and Lehmann (1978:692) 

to provide information on the specific knowledge or 
skills possessed oy a student. • • . [with] meaning 
in terms of what the student knows or can do, rather 
than in their relation to the scores made by some 
external reference group • 

CRE, therefore , "describes achievement in performance terms 

(Gronlund, 1981:416) and is in contrast to norm-referenced 

evaluation (NRE) where "an individual's score is interpreted 

by comparing it with the scores obtained by others on the 

same test" (Anastasi, 1982*:94)* 

Skinner purported direct assessment of target 

behavior since trait names such as aggressive, lazy, and 

retarded fail to refer to any unit of behavior suitable for 

study through functional analysis (Bower and Hilgard, 1981: 

198)» Regarding Skinner, Mehrens and Lehmann (1978:51-52) 

attributed CRE to behaviorism and the 

emphases on behavioral objectives, the sequencing and 
individualization of instruction , the development of 
programmed materials • • • and a belief that norm ref- 
erencing promotes unhealthy competition and is injurious 
to low-scoring-students ' s elf -concept s . 

Vocational instructors who use behaviorism commonly evaluate 

student performance from a CRE perspective. To better under- 

stand the appropriate uses of CRE, it is necessary to grasp 

the context of reliability and validity from a CRE position* 
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RELIABILITY FROM A CRITERION-REFERENCED POSITION 



Reliability is defined by Mehrens and Lehmann (1978: 
700) as "the extent to i:hich a test is cbnsistent in measur- 
ing whatever it does measure," Anastasi (1982:118) identi- 
fied the following as sources of error variance in relation 
to their respective reliability coefficients: time sampling 
( test-'retest ) ; content sampling (immediate alternate-form); 
time sampling and content sampling (delayed alternate-form); 
content sampling (split-half); content sampling and content 
heterogeneity (Kuder-Richardson and coefficient Alpha); and 
interscorer differences (scorer). 

In view of the determination of reliability, Popham 
and Husek (1973:134) have identified how, because there may 
be minimal score variability, the classical procedures that 
are used to assess reliability are not appropriate. In 
contrast to NRE, a CRE test may have not only zero internal 
consistency, but the test may also have negative internal 
consistency and still be a good test, Anastasi (1982:130) 
addressed the issue of CRE reliability and stated that CRE 
test data can be used "provided they are applied to the 
appropriate data. . . . the test scores of persons who have 
had different amounts of prior training." 

VALIDITY FROM A CRITERION-REFERENCED POSITION 

Validity is defined by Mehrens and Lehmann (1978:109) 
as "the degree to which a test is capable of achieving 



certain aims" and is further described as "the single most 
important aspect of a test." Three general types of validity 
have been identified* Anastasi (1982:131) defines content 
validity as "the systematic examination of the test content 
to determine whether it covers a representative sample of the 
behavior domain to be measured." Criterion-related validity 
is identified by Mehrens and Lehmann (1978:703-704) as "the 
extent to vrhich scores on the test are in agreement with 
(concurrent validity) or predict (predictive validity) some 
given criterion measure." Construct validity is described by 
Anastasi (1982:144) as the extent theoretical constructs or 
traits such as intelligence or mechanical comprehension are 
measured. In regard to CRE validity, Anastasi (1982:140) 
stated, "the most satisfactory type of criterion measure is 
that based on follow-up records of actual job performance." 

CONCLUSION AND RECOMMENDATIONS 

Vocational instructors must realize that the decision 
to use either CRE or NRE depends on the purpose of the evalu- 
ation process. CRE, which is used to "assess the effective- 
ness of given instructional treatments in achieving specific 
instructional objectives" (Isaac and Michael, 1981:110), can 
be as equally appropriate as NRE which must consider "the 
test performance of the standardized sample" (Anastasi, 1982: 
65). Although NRE has its appropriate applications, voca- 
tional instructors must also recognize the appropriate uses 
of CRE as they assess achievement of predetermined criteria. 
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